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GGCAGAACGGGCATAGTGGAACGGACCGTGTGGTATGCTCCGCGATGTCTGAGGCGACTTGGTG 

GTACCGAGGAGGGACTTCAAAACATGACCTGCATTACAGAAGGGAAGCGGAGGTTAACACCACA 

CTCGAGGAGTTGTTACTCTATTTTATTTTCTTAATAAATCTATGCATATTGACTTTTGGGATGG 

TCAATCCACATATGTACTATTTAAACAAAGTTATGTCGTCTCTGTTTGTGGACACTTCTCTACC 

TGATGATGAAAGAAGCAGCTTTAGGTCCATTCGGAGCATAACTGAGTTTTGGAAGTTCATGGAA 

GGACCCCTCATCGACGGCTTGTACTGGGACTCGTGGTATGGCACCAAACAGCTGTACAGTGTGA 

AGAACAGCAGCCGCATCTACTACGAGAACGTTCTTCTCGGCATCCCCAGAGTGCGGCAACTGCG 

AGTCCGAAACAACACTTGCAAGGTCTACCCAGCTTTCCAGTCCCTGGTCAGCGACTGCTACAGC 

AAGTACACAGTGGAAAACGAAGACTTCTCTGATTTTGGCCTCAAACGCAATCCAGAATGGACGC 

ACACGCCTTCTTCCCGCACTGCCCCATGGCACTGGGGGTTTGTTGGCGTATACCGAGATGGAGG 

ATATATAGTCACGTTATCAAAATCAAAATCTGAAACCAAAGCCAAATTTGTTGACCTTCGACTG 

AACAACTGGATTAGCAGAGGCACCAGGGCTGTTTTTATTGATTTCTCCCTGTACAATGCTAATG 

TCAACCTGTTTTGCATCATCAGGCTGCTGGCAGAGTTCCCTGCGACGGGTGGGCTCCTCACCTC 

CTGGCAGTTCTACTCTGTGAAGCTCCTCAGATACGTCTCCTACTACGACTACTTCATTGCCTCC 

TGTGAAGTCATATTTTGTATTTTTCTCTTTGTCTTCATAATACAAGAACTGAGGAAAGTGAACG 

AGTTTAAGTCTGCCTATTTCAGAAGTGTCTGGAACTGGCTGGAGATGCTGCTCCTGCTGCTCTG 

TTTTCTCGCCGTGTCTTTCTATGCATACTGTAACATGCAGAGCTTTCTCTTGCTTGGACAGCTG 

CTGAAAAACACTGACAGCTATCCCGACTTTTACTTCCTTGCATACTGGCACATTTACTATAACA 

ACGTAATTGCTATCACTATCTTCTTTGCATGGATAAAGATATTCAAGTTCATAAGCTTCAATGA 

GACAATGTCGCAGCTGTCATCAACACTCTCCCGCTGCATGAAGGACATCGTGGGGTTCGCCATC 

ATGTTCTTCATCATCTTCTCTGCTTATGCCCAGTTGGGATTTCTGGTTTTTGGGTCACAGGTTG 

ATGATTTTTCAACTTTTCAAAATTCCATATTTGCACAATTTCGAATTGTCCTCGGGGACTTTAA 

CTTTGCTGGCATCCAGCAGGCCAACTGGATCTTGGGGCCCATCTACTTCATCACGTTCATCTTC 

TTTGTGTTCTTTGTGCTCCTGAACATGTTCTTGGCAATAATTAATGACACCTATTCTGAAGTTA 

AGGCTGATTATTCAATAGGCAGAAGACCAGATTTTGAACTTGGTAAAATAATTCAAAAGAGTTG 

CTTTAATGTTCTCGAGAAACTCAGACTCAAGAAAGCTCAAGCTAAAGAAGAAAAGAAAATGCAA 

ACCACTGACTTGGCCCAGAGAGCCAGAAGAGAAGGCTTTGATGAAAGTGAGATCCAAGAGGCAG 

AGCAGATGAAAAGATGGAAGGAAAGGCTTGAAAAAAAGTATTATTCTACAGAAATTCAAGACGA 

TTATCAGCCTGTCACTCAGCAAGAATTCCGAGAGCTCTTTTTATACGCGGTGGAGCTTGAGAAG 

GAATTACACTATGTCAGTTTAAAACTGAACCAACTGATGAGAAAGCTGCACTAGCAGGCTGACA 

AGTGGAGTCATTTTATAAGAAAAGGCAACCGAAGAATTTCATTCAGTATGAAGATTTTTTCCTC 

TCAATTTTCTTCAGTAATGCAAAAGAAAAACCAAAAAGTAACCAGAAGTGCTTTTTATTTCAAA 

GTTCTTGAAGTAAAAGAGTAAAACTCTTGTCCTTTGCTAACAGCCGTGTCTGCAGTAAAACAAT 

GAAGGAGCCTGCGTGTTTCCTAAGTGTGGAGAGGATCTGCGGGAATGTGGAACAGCTTTCCTTG 

CCTACTGGAACCACAAACAAGCACACAATGGGACTCTCTGAGTGCCTGACAAAGTGAACGCAAG 

TACAGCCAAGCACACATGGTGAACTGTCAGGGAACACAAGCACTTTATGGCGTCAACTTTCAAG 

GAACATATTTTATATGGATTTTGAAGAGTCTTGTTTGCTGATAAGAACTTCAAGAAGTCTAAGC 

TTGGCTTTGATTCTCTTGTATTCCTTATATTCCTCAAGCACCGGAACACGATCCTCCTTCTGGG 

CATTCCTAGGGAAGATAAAACTCTGTAAAGCAAAAAAAAGAAAAAAAAAAAAAA (SEQ ID 

NO: 1) 
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MSEATWWYRGGTSKHDLHYRREAEVNTTLEELLLYFIFLINLCILTFGMVNPHMYYLNKVMSSL 
FVDTS LPDDERS S FRS I RS I TEFWKFMEGPL I DGL YWDS WYGTKQLYS VKNS SRI Y YENVLLG I 
PRVRQLRVRNNTCKVYPAFQSLVSDCYSKYTVENEDFSDFGLKRNPEWTHTPSSRTAPWHWGFV 
GVYRDGGYIVTLSKSKSETKAKFVDLRLNNWISRGTRAVFIDFSLYNANVNLFCIIRLLAEFPA 
TGGLLTSWQFYSVKLLRYVSYYDYFIASCEVIFCIFLFVFIIQELRKVNEFKSAYFRSVWNWLE 
MLLLLLCFLAVSFYAYCNMQSFLLLGQLLKNTDSYPDFYFLAYWHIYYNNVIAITIFFAWIKIF 
KFISFNETMSQLSSTLSRCMKDIVGFAIMFFIIFSAYAQLGFLVFGSQVDDFSTFQNSIFAQFR 
IVLGDFNFAGIQQANWILGPIYFITFIFFVFFVLLNMFLAIINDTYSEVKADYSIGRRPDFELG 
KIIQKSCFNVLEKLRLKKAQAKEEKKMQTTDLAQRARREGFDESEIQEAEQMKRWKERLEKKYY 
STEIQDDYQPVTQQEFRELFLYAVELEKELHYVSLKLNQLMRKLH (SEQ ID NO: 2) 
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underlined = deleted in targeting construct 
BOLD = sequence flanking Neo insert in targeting construct. 
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GGCAGAACGGGCATAGTGGAACGGACCGTGTGGTATQCTCCGCGATGTCTGAGGCGACTT 
GGTGGTACCGAGGAGGGACTTCAAAACATGACCTGQATTACAGAAGGGAAGCGGAGGTTA 
ACACCACACTCGAGGAGTTGTTACTCTATTTTAT'PTTCTTAATAAATCTATGCATATTGA 
CTTTTGGGATGGTCAATCCACATATGTACT ATTTAAACAAAGTTATGTCGTCTCTGTTTG 

tggacacttctctacctgatgatgaaagaagcagctttaggtccattcggagcataactg 
agttttggaagttcatggaaggacccctcatcj5acggcttgtactgggactcgtg gtatg 
gcaccaaacagctgtacagtgtgaagaacagfcagccgcatctactacgagaacgttcttc 
tcggcatccccagagtgcggcaactgcgagtrccgaaacaacacttgcaaggtctacccag 
ctttccagtccctggtcagcgactgctacmcaagtacacagtggaaaacgaagacttct 
ctgattttggcctcaaacgcaatccagaotggacgcacacgccttcttcccgcactgccc 
catggcactgggggtttgttggcgtatmtcgagatggaggatatatagtcacgttatcaa 
aatcaaaatctgaaaccaaagccaaa'ottgttgaccttcgactgaacaactggattagca 
gaggcaccagggctgtttttattgat/ttctccctgtacaatgctaatgtcaacctgtttt 
gcatcatcaggctgctggcagagtt/cctgcgacgggtgggctcctcacctcctggcagt 
tctactctgtgaagctcctcagatacgtctcctactacgactacttcattgcctcctgtg 

^TTGTCTTCATAATACAAGAACTGAGGAAAGTGAACG 
AGTTTAAGTCTGCCTATTTCAGA^GTGTCTGGAACTGGCTGGAGATGCTGCTCCTGCTGC 
TCTGTTTTCTCGCCGTGTCTTTCTATGCATACTGTAACATGCAGAGCTTTCTCTTGCTTG 
GACAGCTGCTGAAAAACACTGACAGCTATCCCGACTTTTACTTCCTTGCATACTGGCACA 
TTTACTATAACAACGTAATT^CTATCACTATCTTCTTTGCATGGATAAAGATATTCAAGT 

\TGTCGCAGCTGTCATCAACACTCTCCCGCTGCATGAAGG 
ACATCGTGGGGTTCGCCAyCATGTTCTTCATCATCTTCTCTGCTTATGCCCAGTTGGGAT 
TTCTGGTTTTTGGGTCAOAGGTTGATGATTTTTCAACTTTTCAAAATTCCATATTTGCAC 
AATTTCGAATTGTCCTGGGGGACTTTAACTTTGCTGGCATCCAGCAGGCCAACTGGATCT 
TGGGGCCCATCTACTT|CATCACGTTCATCTTCTTTGTGTTCTTTGTGCTCCTGAACATGT 
TCTTGGCAATAATTAATGACACCTATTCTGAAGTTAAGGCTGATTATTCAATAGGCAGAA 
?TTGGTAAAATAATTCAAAAGAGTTGCTTTAATGTTCTCGAGAAAC 
TCAGACTCAAGAAAfeCTCAAGCTAAAGAAGAAAAGAAAATGCAAACCACTGACTTGGCCC 
AGAGAGCCAGAAGAGAAGGCTTTGATGAAAGTGAGATCCAAGAGGCAGAGCAGATGAAAA 
GATGGAAGGAAAjSGCTTGAAAAAAAGTATTATTCTACAGAAATTCAAGACGATTATCAGC 
tAAGAATTCCGAGAGCTCTTTTTATACGCGGTGGAGCTTGAGAAGGAAT 
^AGTTTAAAACTGAACCAACTGATGAGAAAGCTGCACTAGCAGGCTGACA 
AG TG GAG TCAfr TTTATAAGAAAAGGCAAC CG AAGAATTTCATTCAGTATGAAGATTTTTT 
hcTCTTCAGTAATGCAAAAGAAAAACCAAAAAGTAACCAGAAGTGCTTTTT 
STTCTTGAAGTAAAAGAGTAAAACTCTTGTCCTTTGCTAACAGCCGTGTCTG 
CAGTAAA^CAATGAAGGAGCCTGCGTGTTTCCTAAGTGTGGAGAGGATCTGCGGGAATGT 
GGAACAQCTTTCCTTGCCTACTGGAACCACAAACAAGCACACAATGGGACTCTCTGAGTG 
\GTGAACGCAAGTACAGCCAAGCACACATGGTGAACTGTCAGGGAACACAAG 
CACTTjfATGGCGTCAACTTTCAAGGAACATATTTTATATGGATTTTGAAGAGTCTTGTTT 
rAAGAACTTCAAGAAGTCTAAGCTTGGCTTTGATTCTCTTGTATTCCTTATATTC 
CTCA^GCACCGGAACACGATCCTCCTTCTGGGCATTCCTAGGGAAGATAAAACTCTGTAA 
AGC&AAAAAAAGAAAAAAAAAAAAAA 



CTGTCACTCAG? 
TACACTATGTG 



CCTCTCAAT/I 
ATTTCAAAC 
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Gene Sequence Structure 



211 bp 



Sequence Deleted 
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Size of full-length 
cDNA: 2486 bp 



Targeting Vector* 
(genomic sequence) 



Arm Length: 
5': 2.5 kb 
3': 1 kb 



Targeting Vector 

----- Endogenous Locus 

* Not drawn to scale 



5 f arm 



LacZ-Neo 
Cassette 



3' arm 




5 " >CATGCATACATGGGTGCACGT 
GCGTGCGTGCAAACACCCACACAC 
AGATAGATAAATGTAAAAAAATAC 
TTAGGCAATGCTGTTTTTTTTAAG 
CTTGGGAAATTAATATTTTTGAAC 
ACTGCTGATGTGGGATTTTAAGTT 
ACAAATGTTAAACTTGTTCTCCTT 
AGTGACTTTTGGGATGGTCAATCC 
ACATATGTACT< 3 1 
(SEQ ID NO: 3) 



5 1 >GTATGGCAACAAACAGCTGTA 
CAGTGTGAAGAACAGCAGCCGCAT 
CTACTACGAGAACGTTCTTCTCGG 
CATCCCCAGAGTGCGGCAACTGCG 
AGTCCGAAACAACACTTGCAAGGT 
CTACCCAGCTTTCCAGTCCCTGGT 
CAGCGACTGCTACAGCAAGTACAC 
AGTGGAAAACGAAGACTTCTCTGA 
TTTTGGCCTCA< 3 ■ 
(SEQ ID NO: 4) 
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